NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

GenSQL: A Probabilistic Programming System for Querying Generative Models of Database Tables

https://doi.org/10.1145/3656409

Huot, Mathieu; Ghavami, Matin; Lew, Alexander K; Schaechtle, Ulrich; Freer, Cameron E; Shelby, Zane; Rinard, Martin C; Saad, Feras A; Mansinghka, Vikash K (June 2024, Proceedings of the ACM on Programming Languages)
Hicks, Michael (Ed.)
This article presents GenSQL, a probabilistic programming system for querying probabilistic generative models of database tables. By augmenting SQL with only a few key primitives for querying probabilistic models, GenSQL enables complex Bayesian inference workflows to be concisely implemented. GenSQL’s query planner rests on a unified programmatic interface for interacting with probabilistic models of tabular data, which makes it possible to use models written in a variety of probabilistic programming languages that are tailored to specific workflows. Probabilistic models may be automatically learned via probabilistic program synthesis, hand-designed, or a combination of both. GenSQL is formalized using a novel type system and denotational semantics, which together enable us to establish proofs that precisely characterize its soundness guarantees. We evaluate our system on two case real-world studies—an anomaly detection in clinical trials and conditional synthetic data generation for a virtual wet lab—and show that GenSQL more accurately captures the complexity of the data as compared to common baselines. We also show that the declarative syntax in GenSQL is more concise and less error-prone as compared to several alternatives. Finally, GenSQL delivers a 1.7-6.8x speedup compared to its closest competitor on a representative benchmark set and runs in comparable time to hand-written code, in part due to its reusable optimizations and code specialization.
more » « less
Full Text Available
Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yang, Yichen; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (January 2021, Advances in neural information processing systems)

A key challenge for reinforcement learning is solving long-horizon planning problems. Recent work has leveraged programs to guide reinforcement learning in these settings. However, these approaches impose a high manual burden on the user since they must provide a guiding program for every new task. Partially observed environments further complicate the programming task because the program must implement a strategy that correctly, and ideally optimally, handles every possible configuration of the hidden regions of the environment. We propose a new approach, model predictive program synthesis (MPPS), that uses program synthesis to automatically generate the guiding programs. It trains a generative model to predict the unobserved portions of the world, and then synthesizes a program based on samples from this model in a way that is robust to its uncertainty. In our experiments, we show that our approach significantly outperforms non-program-guided approaches on a set of challenging benchmarks, including a 2D Minecraft-inspired environment where the agent must complete a complex sequence of subtasks to achieve its goal, and achieves a similar performance as using handcrafted programs to guide the agent. Our results demonstrate that our approach can obtain the benefits of program-guided reinforcement learning without requiring the user to provide a new guiding program for every new task.
more » « less
Full Text Available
Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yang, Yichen; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (January 2021, Advances in neural information processing systems)

Full Text Available
Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yang, Yichen David; Inala, Jeevana Priya; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (January 2021, Advances in neural information processing systems)

Full Text Available
Program Synthesis Guided Reinforcement Learning for Partially Observed Environments

Yang, Yichen D.; Inala, Jeevana P.; Bastani, Osbert; Pu, Yewen; Solar-Lezama, Armando; Rinard, Martin (January 2021, Advances in neural information processing systems)

Full Text Available
Neurosymbolic Transformers for Multi-Agent Communication

Priya, Jeevana I; Yang, Yichen; Paulos, James; Pu, Yewen; Bastani, Osbert; Kumar, Vijay; Rinard, Martin; Solar-Lezama, Armando (December 2020, Advances in neural information processing systems)
null (Ed.)
Full Text Available

Search for: All records